Corpus: eng-za_newscrawl_2013

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 11635 s-
2 11025 M-
3 10894 S-
4 8227 c-
5 7878 C-
Top Character Bigrams
word rank frequency n-gram
1 4377 Ma-
2 3262 co-
3 3023 re-
4 2723 in-
5 1996 Th-
Top Character Trigrams
word rank frequency n-gram
1 1331 The-
2 997 Mar-
3 985 con-
4 799 the-
5 793 com-
Top Character 4-Grams
word rank frequency n-gram
1 1147 The-
2 569 the-
3 538 over-
4 435 non--
5 395 unde-
Top Character 5-Grams
word rank frequency n-gram
1 370 under-
2 288 inter-
3 278 John-
4 258 self--
5 249 Chris-
1847 msec needed at 2018-02-24 15:22